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Abstract — In peer-to-peer networks, free riding is a ma- 
jor problem. Reputation management systems can be used 
to overcome this problem. Reputation estimation methods 
generally do not considers the uncertainties in the inputs. 
We propose a reputation estimation method using BLUE 
(Best Linear Unbiased estimator) estimator that consider 
uncertainties in the input variables. 

Reputation aggregation in peer to peer networks is gener- 
ally a very time and resource consuming process. Moreover, 
these methods consider that reputation of a node is same 
for every other node in the network, while this is not true. 
This paper also proposes a reputation aggregation algorithm 
that uses a variant of gossip algorithm called differential 
gossip. Differential gossip is fast and requires less amount of 
resources. This mechanism allows computation of different 
reputation value of a node for every other node in the 
network. We have implemented Differential Gossip Trust for 
power law network formed using PA Model. The reputation 
computed using differential gossip trust shows good immu- 
nity to collusion. We have verified the performance of the 
algorithm on power law networks of different sizes ranging 
from 100 nodes to 50,000 nodes. 

I. Introduction 

Peer-to-peer systems have attracted considerable at- 
tention in recent past as these systems are more scalable 
then client server systems. But distributed nature of peer- 
to-peer networks brings many challenges for system 
designers. These networks are designed keeping in mind 
that every node is honest and co operative. It means, if 
some node takes some resource from the community, it 
will also facilitate the community. 

But nodes are the entities operated by rational human 
beings. So nodes are expected to behave in a selfish 
manner i.e. they try to maximise their utility. This results 
in their non co-operative behaviour. This phenomenon is 
explained by famous Prisoners Dilemma, in which Nash 
Equilibrium (NE) is achieved when both prisoners de- 
ceive each other [1J. Similarly in a file sharing network, 
if nodes are considered as players, their NE happens 
when none of them share the resources [2]. Tendency 
of nodes to draw resources from the network and not 
giving any thing in return is termed as 'Free Riding'. 
An Experimental study) 3] on Gnutella network in 2005 



also confirmed this fact by showing that number of free 
rider nodes is as high as 85%. 

Such type of problems also exist in e-commerce sys- 
tems like e-bay. In e-commerce portals like e-bay, people 
sell and buy different things online. Buyers and sellers 
generally do not know each other. So the possibility 
of cheating or possibility of providing a product or 
service of inferior quality always exists. To avoid this, 
e-bay uses a rating based reputation system. After every 
transaction, user gives a feedback rating to his counter 
part and based on these ratings, reputation is decided. 
This reputation helps users in making decision about 
transactions l¥l . 

Trust or reputation management systems can also be 
used in peer to peer networks to overcome the problem 
of free riding as well as to ward off some of the attacks. 
In a peer to peer file sharing network, trust or reputation 
of a node represents its co-operative behaviour towards 
other nodes. A node seeking some resource from another 
node measures the ratio of received resource to the 
requested resource after every transaction and uses it 
to update the trust value. If a node imparts all the 
requested resource, it's reputation is considered as one 
and if a node always declines sharing of resources, it has 
reputation as zero. 

The advantage with e-bay like systems is that these 
have a central server which keeps all the feedback rating 
and reputation related data. So, if a user wants to check 
any other user's reputation, he just asks the central 
server and receives authentic information. In peer to 
peer file sharing networks, as there is no central server, 
trust has to be estimated and stored by each node in a 
distributed fashion. 

Good methods for trust estimation and trust aggrega- 
tion are needed to design a good reputation management 
system. Trust can be estimated in a very simple way 
as the ratio of received to requested resources but this 
simple method can not overcome the effect of noise (i.e. 
uncertainty) in the estimation of trust value. We are 
proposing a trust estimation method using BLUE (Best 
Linear Unbiased Estimator) |28|. This method overcomes 
the noise effects considerably and requires almost same 



2 



amount of memory and computation. 

Aggregation of trust generally consumes a lot of time 
and memory. It becomes even more difficult when num- 
ber of nodes is large. Moreover, existing methods assume 
that the reputation of a peer must have a global value, 
i.e. peers behave uniformly with all other peers, but this 
is not true at least in the case of selfish nodes. This 
paper proposes a good scalable method using a varia- 
tion of gossip algorithm [30 j which generates different 
trust vectors at different nodes. In computation of these 
vectors, global trust values are also taken into account. 
This algorithm has been shown to work with networks 
having power law degree distribution, i.e. networks 
formed by Preferential Attachment (PA) model. 

Remainder of this paper is organised as follows: Sec- 
tion two discuses the related work in reputation manage- 
ment while section three describes the system model. In 
section four the estimator for trust using BLUE is de- 
rived and section five proposes differential gossip trust 
algorithms for aggregation of trust. Section six presents 
the numerical results and section seven concludes the 
paper. 

II. Related Work 

Different authors have used different methods for 
estimation of trust value of a node. It is also termed 
as the contribution from the node. Buragohain et.al. O 
and Wang et.al. |6| take the ratio of resource contributed 
by the node to the ratio of absolute measure of contri- 
bution, multiplied by the cost of resource. Buragohain 
et.al. does not discuss the mechnism to measure of 
contributions of a node by the receiving node. In (7| 
receiving node computes the trust value of a node on 
the basis of received data in the transactions with the 
sending node. In [7], quality of transaction is sent to 
service provider node in form of a cookie. Duttay et.al. 
JSJ and Papaioannou et.al. ||9j suggest that each node 
should calculate reputation of other node on the basis of 
service received from the other nodes depending upon 
number of transactions done with those nodes, delay in 
the transactions and the download speed. Andrade et.al. 
[10 1 calculates the reputation of a node by taking the 
difference of resources received from and provided to 
the node. In Jni,lll2,llll,lll6l and EL a node adjusts 
the reputation of other node on the basis of quality of 
transactions with that node. Eigen-Trust [12J uses sum of 
positive and negative ratings, Peer-Trust |13| normalises 
the rating on each transaction whereas Power-Trust [16] 
uses Bayesian approach to calculate reputation locally. 

Mengshu et.al. 1 18 1 takes the ratio of successful trans- 
actions to total transactions. PET |15| classifies the ser- 
vice quality into four types (every type provides some 
score), then calculates ratio of score obtained by a node 
to total score it could have got. After calculating this 
ratio, it is combined with feedback obtained from dif- 
ferent nodes and final reputation is calculated. Banerjee 
et.al. 1 19 1 calculates reputation of a node for a particular 



type of resource by taking the ratio of number of times 
this type of resource was served by that node to the 
number of times this type of resource was asked from 
that node. In Fuzzy-Trust |l4l . 11201 , nodes do fuzzy 
inference on parameters to calculate the trust score locally 
for another node 'x' and then aggregate it with trust 
scores of the node 'x' as received from other nodes using 
their weights. Peer can also give rank to the interacting 
nodes on the basis of normalized download volume 
received from them during a fixed period El . The rank 
of an interacting node is computed using a function 
that increases when the node provides a resource and 
decreases when a resource is allocated to the node |22|. 
In Il23l , peer maintains a binary vector of m bits. After 
a transaction happens one or zero is added at most 
significant position of the vector, after shifting right all 
the previously placed bits by one place. This trust vector 
is considered as a m bit binary number. To compute the 
trust, value of the m bit binary number is divided by 2 m . 
This ensures that the trust value lies in between and 
1. In [24 1, peer takes the ratio of the total contribution 
received from a node to the sum of total contribution 
received from that node and total allocation done by 
the peer for that node. In |25|, peer calculates the ratio 
of the sum of resource it has received from a node to 
the resource it has requested to that node for last ten 
transactions. Moon et.al. 11261 classifies the services from 
a node and assigns different weights to different services, 
e.g. weight for file transfer is 0.4, weight for query 
response is 0.2 etc.. Peer gives score to the interacting 
node accordingly. 

Many methods JEJ, (131, El, OH and El have been 
proposed for reputation aggregation in the literature. 
Eigen-Trust [12] depends largely on pre-trusted peers 
i.e. peers that are globally trusted, this is scalable to 
a limited extent. Peer-Trust [13] stores the trust data 
(i.e. trust values of all the peers in the network) in 
a distributed fashion. This is performed using a trust 
manager at every node. In Peer-Trust, hash value of 
a node id is calculated to identify the peer where the 
trust value of node will be stored. Song et.al. [14J uses 
fuzzy inference to compute the aggregation weights. In 
Fuzzy-Trust, each peer maintains the local trust value 
and transaction history of the remote peer. At the time 
of aggregation each peer asks for the trust value from 
the qualified peers and combine the received values and 
locally existing values to compute updated trust values. 
Power trust [16J depends largely on power nodes. Power 
nodes are few top reputation nodes in the network. It 
uses score manager like trust manager in Peer Trust and 
Look ahead Random Walk for aggregation. Gossip Trust 
ITTll uses gossip algorithm for aggregation. 

III. System Model 

In this paper we are studying a peer-to-peer network. 
There is no dedicated server in this network. Peers in this 
network are rational, i.e. they are only interested in their 
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own welfare. They are connected to each other by an 
access link followed by a back bone link and then again 
by an access link to the second node.We are assuming 
that the network is heavily loaded i.e. every peer has 
sufficient number of pending download requests, hence 
these peers are contending for the available transmission 
capacity. We also assume that every peer is paying the 
cost of access link as per the use. So, every peer wants 
to maximises it's download and minimise it's upload so 
that it can get maximum utility of it's spending and this 
leads to problem of free riding. 

If a node is downloading, some other node has to 
upload. So the desired condition is, download should 
be equal to upload for a node. Usually this means 
that there is no gain. Even in this scenario the node 
gains due to interaction with others, chance of survival 
increases. Thus interaction itself is an incentive. A node 
will usually try to get the content and avoid uploading it 
as this maximises gain for it. Thus free riding becomes 
optimal strategy. So a reputation management system 
need to be enforced to safeguard the interest of every 
node by controlling free riding. 

In reputation management system, every node main- 
tains a reputation table. In this table, the node maintains 
the reputations of the nodes with whom it has interacted. 
Whenever it receives a resource from some node, it 
adjusts the reputation of that node accordingly. When a 
node asks for the resource from this node, it checks the 
reputation table and according to the reputation value of 
requesting node, it allocates resource to that node. This 
ensures that every node is facilitated from the network 
as per its contribution to the network and consequently 
free riding will be discouraged. 

For using such a reputation management system node 
needs to estimate the trust value of the nodes interacting 
with it. Trust value observed by node i for node j in 
transaction k can be defined as 
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Here R k . represents the amount of resources requested 
by node i from node j for k transaction; Z.. represents 
the amount of resource received by node i from node j 
in k th transaction. 

In estimation of trust value by such methods few 
important points are missed out. These points are as 
follows. 

1) In p2p networks, when a peer asks for some re- 
source, it is not guaranteed that it will get the asked 
resource. So, generally peer asks for larger amount 
of resource than needed and when it is offered 
more resource then its requirement, it refuses the 
extra offered resource. Also it does not give any 
credit for this extra offer. 

2) Once requesting node decides about the node from 
which it is going to take data, both of the nodes 



decide about the rate of data according to their 
upload and download capacities. But at the some 
time underlying network may not able to provide 
this kind of data rate because of congestion in the 
network at different routers. So, even if service 
provider node is trying to give data at the commit- 
ted rate to the requesting node, it can not give and 
this affects the reputation assignment as reputation 
is assigned based on the actual data rate. 
3) If some requested node has already got too many 
requests, it will not be able to provide the quality 
of service that it could have provided with lesser 
load. Hence the requesting node needs to estimate 
the reputation of service provider node considering 
the load and previous transactions with that node. 
To resolve above issues, the trust value should be calcu- 
lated according to the following method: if node j offered 
the data against the request of node i and it didn't accept 
the offer. 

kj = 5, (2) 

Here 5 should be upper bounded in such a way that no 
node can play game for its benefit. It means if value of 5 
will be too high, node will made an offer even if it don't 
want to serve and on asking for resource it will deny. So 
node will upper bound it by the ratio of its download 
capacity to the total requests made by the node. 
Whereas if node i accepts the offer 

actual service rate willing service rate 



^ feasible service rate * requested service rate ^ 

Here actual service rate is the average rate at which 
receiver received the data and feasible service rate is 
the rate at which the TCP Reno algorithm can get the 
throughput via underlying link with packet loss proba- 
bility p. This rate can be computed using the expression 
given in E711 . 



(1) B(p) 
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KTT RTT^ + T -minll,3^f\p(l+32p 2 ) 

(4) 

Here B is feasible service rate as a function of packet 
loss probability p; W„,„ x is the maximum window opened 
by receiver, RTT is the round trip time between two 
nodes; To is timeout period and b is number of packets 
acknowledged by single a acknowledgement. 

Ratio of willing service rate and requested service rate 
(let's call it A) needs to be estimated on the basis of 
observed samples of the ratio of offered service rate and 
requested service rate (let's call it x[n], where n is time 
instant). Let the model for the estimation of A be 



x[n] = A - w[n], 



(5) 



Here x[n] is the observed value for a node in a particular 
transaction, A is the exact value and w[n] is the noise 
in estimating the exact value. Let us assume that A is 
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constant and mean value of w[k] is W and the ratio of 
W and A is C 



C 



I Cixc 2 ' 
|0, 



if Ci x C 2 > 1. 
otherwise. 



(6) 



Here 
Ci = 

c 2 = 



(7) 



requests made by the node 
download capacity of the node 
total capacity shared by the nodes in the network 
total requests made by nodes in the network 

Here C\ is the ratio of the total request made by 
that node and download capacity of the node i.e. how 
much node over requested, and C 2 is the ratio of total 
capacity shared by all the nodes and total requests made 
in the network by all the nodes. C 2 is statistically the 
average offer against unit request by a node in the 
network. Multiplying C 2 with C\ will give us the average 
offer to the node against its capacity. Uncertainty have 
identical distribution for every sample and samples are 
independent, i.e. the noise samples are i.i.d.. Hence every 
sample will have same variance. Let us assume this 
variance be a. 



IV. Estimation of Trust 

Based on observed values of trust, exact trust value 
can be estimated using some estimator. We have used 
Best Linear Unbiased Estimator (BLUE) |28J for this 
purpose. Taking expectation of x[k] in eqn( |5]l 



E[x[k]] = E[A] - E[w[k]], 
= A-W. 



so the scaled mean will be 



S[k] 



E[x[k]] 




(9) 
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where is the parameter to be estimated i.e. A in this 

case, so 



S[k] = 



A-W 



(11) 
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so 



S = 1 
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The covariance matrix 



C = 



a 2 

a 2 

a 2 



NxN 



C = CT 2 I 



NxN 



(13) 



(14) 



As we know that the BLUEI28I is 



A = 



S'C^X 
S'C^S 



(15) 



substituting the values, 



sInxn' 



l-f)l^lNxN(l-f)l 



(16) 



solving, we get 



A = 



N 
k=l 



(17) 



1 _ n. 

1 A 

x[k] 
~ 1 _ vv 

1 A 

So, we can see that we need to compute the sample 
mean of all samples and the ratio of noise mean and 
parameter mean. 

This formulation have two problems, first it is difficult 
to compute the sample mean when the number of sam- 
ples is large and second, if value of a is changing after 
some time, this estimator doesn't consider that change. 
So, we can use exponential moving average of samples 
i.e. 

x[k] = a • x[k] + (1 - a) ■ x[k - 1] (18) 

Here a depends on the rate of change in behaviour of 
nodes. We have evaluated the performance for three 
different values of a i.e. 0.1,0.01 and 0.001 whereas 
x[i] =x[l]. 

Value of C 2 for complete network is difficult to be 
found out, so node will periodically ask high degree 
node(s) near to it. The high degree node(s) will generate 
this data as per requests and offers from its neighbouring 
nodes. 

V. Aggregation of Trust 

Typically there will be millions of nodes in a peer-to- 
peer network. These nodes are connected by a network 
graph generated by PA process G„ [32] for m > 2. 
Generally nodes will have small number of neighbours. 
Here by neighbour, we mean those nodes that have their 
address stored with the particular node. 

Whenever a node needs a resource, it asks from its 
neighbours; if they have the resource, it gets the resource. 
If neighbours do not have it, they forward the query 
to their neighbours and so on. The node that have 
required resource, replies back to requesting node and 
then requesting node asks for the resource from that 
node. The answering node provides the resource to that 
node according to the reputation of that node. 

If a node receives a request from another node that 
is not its neighbour, some estimate reputation of that 
node needs to be estimated some how in order to decide 
the quality of service to be provisioned to this node. If 
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two nodes are going to transect for the first time they 
should have reputation of the each other with them. 
This can be done by getting the reputation of node from 
neighbours and using it to make an initial estimation. 
When for a node, multiple trust values are received, 
we need an aggregation mechanism to get single trust 
value. Normalisation of trust value will be integral part 
of aggregation mechanism as trust value always lies in 
between zero and one. 

Thus trust values will be propagated and aggregated 
by all the nodes in a network. For the whole network we 
can define a trust matrix T, a matrix of dimensions NxN. 
Here T,j represents the trust value of i as maintained by 
j. This matrix is generally sparse in nature as generally 
a node will have very small number of neighbours as 
compared to total number of nodes in the network. It 
may be noted that T;; is estimated based on transaction 
between nodes i and j and can be called as local trust 
value. Thus the trust estimate will be based on aggre- 
gation of local trust values and trust estimates received 
from neighbours. To convert this sparse matrix into a 
normal matrix is basically reputation aggregation. 

Generally all of these approaches [12|,|13J,|16j and 
llT7l assume that reputation of a peer must have a 
global value, i.e. peers behave uniformly with all peers. 
But this is not the case at least in case of selfishness. 
Peer behaves with different decency levels with different 
peers. The same holds for the opinion as well, i.e. peer 
gives different weights to different opinions of different 
peers. These aspects have been taken into account in our 
algorithm called Differential Gossip Trust. 

A. Differential Gossip Trust 

The algorithm can be divided into two parts. In first 
part we will discuss about the method of information 
diffusion whereas in second part we will discuss about 
the information that is to be diffused. 

1) Differential Gossip Algorithm: Gossip Algorithms are 
used for information spreading in large decentralised 
networks. These algorithms are generally simple, light 
weighted and robust for errors. They have less overhead 
compared to deterministic algorithms [30|,[29j. Gossip 
algorithms are also used for distributed computation like 
taking average of numbers stored at different nodes. So 
these type of algorithms are suitable for computation of 
reputation vector in peer-to-peer networks as used in 

ma. 

There are three types of gossip algorithms: push, pull 
and push-pull. In push kind of algorithms node chooses 
a node in every gossip step among its neighbours ran- 
domly and pushes its information to that node whereas 
in pull algorithms node takes the information from the 
randomly selected node. Both of these processes happen 
simultaneously in push-pull algorithms. 

In [13| push gossip algorithm given in 1130] is used. 
This algorithm considers that network is a complete 
graph but the peer-to-peer networks are generally based 



on Preferential Attachment(FA) model |31],[32J. Network 
based on PA model, can be divided in to two types 
of nodes [33 1 viz. power nodes and low degree nodes. 
Power nodes are the nodes that have high degree and 
their degree is of the order of N e . The degree of low 
degree node is of the order of log N. Low degree nodes 
form a linear sub graph within the network and this 
linear sub graph has diameter of 0(log N). Here N is 
the number of nodes in the network and e is a constant 
greater then zero. 

Chierichetti et.al. 11331 says that in PA model based 
networks, push or pull alone cannot spread the infor- 
mation with a reasonably fast speed. If push model 
is implemented and the information is with a power 
node, it will take many rounds in pushing information 
to a low degree node. If pull model is being used, and 
the information is with low degree node, it will again 
take many rounds to pull the information by power 
node. This phenomenon will be more evident in average 
computation using push gossip algorithm or pull gossip 
algorithm as information with every node needs to be 
distributed to every other node. 

To avoid this problem we propose differential push 
gossip algorithm. In this algorithm every node makes 
different number of pushes in a single gossipping step. 
We will discuss algorithm in four phases. In first phase, 
global reputation aggregation for a single node will be 
discussed and in second phase, we will discuss globally 
calibrated local reputation aggregation for a single node. 
In third phase, global reputation aggregation for all 
nodes will be discussed, and finally in forth phase glob- 
ally calibrated local reputation aggregation for all nodes 
will be discussed. Here, we make three assumptions, 
first, every node has a unique identification number that 
is known to every other node. So, if some node pushes 
some information about some other node, receiving node 
knows that this information is about which particular 
node. Second, time is discrete and third, every node 
knows about the start and end of gossip step. 

In the first phase, all nodes gossip their feedback 
about a single node to find its global reputation. Let 
us represent the feedback from the i th node by y ( . Now 
every node assumes a gossip weight (gi) as one. Now 
every node pushes its degree to the node connected to 
it. After this all nodes calculate the average degree of 
their neighbours. Every node has two information to 
gossip, y, and gi. Let us call this pair as gossip pair. 
The ratio of gossip pair is stored. Now every node, first 
calculates the ratio (k) of its degree and average degree 
of its neighbours. Now it chooses k nodes randomly in 
its neighbourhood and sends (k + l) th part of gossip pair 
(jtt yi/ k^igd to ai l k nodes and itself. 

After receiving all gossip pairs from different nodes 
including itself, node sums up all the pairs. This sum- 
mation now becomes new gossip pair. Now the ratio of 
this gossip pair is taken and if difference of this gossip 
pair and previous gossip pair is less then some prede- 
fined small constant, we assume that the algorithm has 
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converged otherwise same process keeps on repeating. 

2) Differential Reputation Aggregation: Let us consider 
that there are N nodes in the network. Every node 
periodically calculates the trust value of other node on 
the basis of quality of service provided by that node 
against the request. Node normalises these trust values. 
If Tij is the trust value measured by node i for node j, 
then normalised trust value f , ; of the node j for the node 
i will be 

i 

These tjj (1 < i,j < N) will form a normalised trust 
matrix t. The matrix t will be stochastic in nature as 
- 1- 11 a node has not transected with any nodes 
its trust value will remain zero. This value is taken as 
zero to avoid the white washing attack. This value can 
be taken higher then zero and then it can be dynamically 
adjusted as per the level of whitewashing in the network. 
But in this paper we have not implemented that. 

Although each node considers the feedback given by 
every node in the network by averaging with equal 
weights, it is desirable to assign different weights to the 
feedback of different nodes, it has already transacted 
with. These weights can be assigned on the basis of 
number and quality of transactions done with other 
nodes and trust value of a node represents these two 
things. So weights for different nodes can be derived on 
the basis of trust values of these nodes. We propose the 
weight Wij as follows: 

Wij = a M << (20) 

Where a and b are two parameters that a node can 
decide on its own. First parameter will be adjusted 
according to the overall quality of service received by 
the node from the network, whereas second parameter 
will be adjusted according to the recommendation of 
the particular neighbour and quality of service from the 
network. So second parameter will be adjusted for every 
neighbour differently. But for now we have taken a and 
b constant for every node. This kind of weight will have 
fallowing features: 

. Even if a node has no neighbourhood relation with 
the estimating node, it's feed back will still get some 
consideration. 

• If a node has bad reputation with estimating node, 
it's feedback will have weight close to the node 
which have no neighbourhood relation with the 
estimating node. 

• Nodes with higher reputation will be given higher 
weights and it will help in making better quality of 
service groups. 

• Values of a and b can be dynamically adjusted by 
nodes as per their requirement. 

A weighted trust matrix, that is different at every node, 
is formed by multiplying trust values with weights i.e. 



for node I, the ij element in weighted matrix will be Wuj 
such that, 

W Uj = w Hj x t^ (21) 

Every node performs two actions in process of reputa- 
tion aggregation. At first a node calculates the global rep- 
utation vector Rgiobai and secondly it gives high weight 
to the feedback given by nodes according to the quality 
of service provided by. This leads to the calculation of 
globally calibrated local reputation vector. The globally 
calibrated local reputation vector is a collection of the 
reputations of all the nodes in the network according to 
received feedback about the node under consideration 
and the weight of feed backing node for calculating 
node. It means if some node I is calculating globally 
calibrated local reputation vector, the i th element of this 
vector will be 

'V-£lV /;; (22) 

i 

To calculate the global reputation vector Rgiobai/ fol- 
lowing matrix-vector computations can be used, 

Rglobal(« + 1) = ^(t T («) X 1 1XN ) (23) 

Here n is the time instant and r, is global reputation of 

ith node and 1i x n is a vector of N l's, i.e. [Ill 1 ]. 

Gossip algorithm is used for doing this computation in 
distributed fashion. 

Now globally calibrated local reputation at node I, Ri 
is calculated using the matrix vector multiplication 

R i(" + 1) = rr^ ; — r-TT tt(wT(m) x l lxN ) 

N * (total neighbour weight + 1) 

(24) 

In Second phase we want to compute the globally 
calibrated local reputation at each node for a node say X. 
As in gossip algorithm, after every step, the value at the 
node keeps on changing as it gets added to the values 
pushed by other nodes. So after few steps it is difficult 
to identify the initial value it had. 

Based on these facts we can modify our algorithm. 
If a node is participating in the process of gossip for 
the first time about node X or reputation of node X at 
this node has changed considerably since last gossip, this 
node will push the reputation to all of its neighbours in 
the zeroth step. Otherwise, this node will not push any 
information in this step. This process will be done by 
all nodes. After this process every node has opinion of 
its neighbours about node X(If a node makes no push, 
its push in last gossip process will be considered). Now 
these reputations will be multiplied by (senders weight- 
1) and summed up as value y. Now normal gossip will 
be done as in algorithm 1. This will lead to a global 
reputation value y. Now y and y will be added to get 
the final value. 

In third phase we want to aggregate the global rep- 
utation of all nodes simultaneously. This algorithm is 
quite similar to algorithm 1 except few changes. Unlike 
algorithm 1, node will push complete vector y, which 
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Algorithm 1 Global Reputation aggregation for a single 

node 

Require: Feedback for the node j by node i y,, gossip 

error tolerance f, 
Ensure: Global Reputation of node j (Ry) 

l: Assume weight i.e. gi = 1 

2: Push self degree to neighbouring node 

3: Take the average of neighbour degree 

4: Calculate the ratio of its degree and average of 
neighbour degree (*,• <- avera JZglbLr c _ 

5: Round off k, for fc; > 1 else take fc, = 1 



) 



m <- 1 {Initialise Gossip Step} 



u 

Si 

repeat 

let {y r ,gr) be all gossip pairs sent to i in previous 
step 

10: y, <- E i/r; gi <- E gr {update gossip pairs} 

r r 

11: choose fc; random nodes in its neighbourhood 
12: send gossip pair (jr^y;, j^gO) to an nodes and 



13 
14: 
15 
16 
17: 
18 
19 
20: 
21 



himself 

m *- m + 1 {increment the gossip step} 
if r * then 

end if 

if | - m < I then 

Increment converged Step(cs) by 1 
end if 
unti l cs < 5fWi 

output I 1 



consists of feedback from node about all other nodes it 
has transected with. Similarly instead of single gossip 
weight g ir node will send vector g t . A node id will also 
be attached with every pair of y, and gi so that receiving 
node can distinguish among gossip pairs. So, in fact, 
node pushes gossip trio consisting of y„ g, and node id. 
The convergence of algorithm is checked by following 
condition: 







fyKn-in 


\gi(n)J 




U(«-l)j 



<4 



(25) 



Where n is the time instant and £, is the permissible 
error bound. 

In the forth phase we want to aggregate the globally 
calibrated local reputation of all nodes simultaneously. 
So same kind of changes will made in algorithm 2, as we 
made in algorithm 1 for third phase. One extra thing that 
will be done at the end of algorithm is normalisation, 
i.e. once every node gets it's globally calibrated local 
reputation vector, it will normalise the vector. 

B. Analysis of Algorithm 

In this section we will study the time that will be 
required to converge the nodes on the average of values 
stored at different nodes. For that, first we will study 



Algorithm 2 Globally calibrated local Reputation aggre- 
gation for a single node 

Require: Feedback matrix t, gossip error tolerance £, 
Ensure: Globally calibrated local Reputation of node j 

(Tij) 
l: Node i do 

2: y ; *- tij {n is the time instant} 

3: calculate Wj for all neighbours of i by formula Wj = 

a u 'i 

4: Assign weight to nodes i.e. gi = 1 
5: if Node i is participating first time in gossipping 
process then 

6: Push feedback about the node under consideration 

to all neighbours 
7: else 

8: if Feedback about the node under consideration 
has changed by more then some constant A then 

9: Push the new feedback to all the neighbours 
10: end if 
11: end if 

12: Push self degree d,to neighbouring node 
13: for k=l to dj do 

14: Calculate y,- <- (w k - 1) x feedback from node k 
15: end for 

16: Take the average of neighbour degree 
17: Calculate the ratio of its degree and average of 
neighbour degree (*,■ <- am JZghbour degree ) 



18 
19 
20 
21 
22: 
23 
24: 



25 
26: 

27: 
28 

29 
30: 
31 
32: 
33 
34: 
35 
36: 



Round off kj for fc; > 1 else take kj = 1 
End do 
m <- 1 and u 
repeat 



{Initialise Gossip Step} 



Do for node i 



let (y r , g r ) be all gossip pairs sent to i in previous 
step 

pi <- y,- + j- x (wij - 1) (j is pushing node) 
Vi <~ L y/,gi <- E gr {update gossip pairs} 

r r 

choose ki random nodes in its neighbourhood 
send gossip pair (^y;, ^Pigi)) to all fc,- nodes and 
himself 

m *- m + 1 {increment the gossip step} 
if r + then 

end if 

if | - u < I then 

Increment converged Step(cs) by 1 
end if 
until cs < °^fl 

37: n, 



''I L^+l 
38: output r, y 
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the spreading of gossip in power law network, then we 
will study the diffusion speed of gossip in power law 
network and then based on these results, we will find 
the time of convergence. 

Theorem V.l. Gossip will spread with high probability in a 
PA based graph, {G™} for m>2, within 0(log 2 N) time using 
differential-push (by high probability we mean, l-o(l), where 
o(l) goes to zero as N increases). 

Proof: Nodes in {G 1 ^} can be classified in to three 
types [33 1 . First type is high degree nodes (i.e. power 
nodes), let us call this type as H. Second type is low 
degree nodes(with degree OQogN)) joined in the first half 
of the process, let us call this type as W and third type 
is low degree nodes joined after W, lets call this type as 
V. 

Chierichetti et.al. |33| proved that if some gossip orig- 
inates at node in V or W, it will reach within 0(log 2 N) 
time to every node with high probability. The problem 
arises when gossip originates in H as, nodes in H has 
high degree. So, if a node in W or V is only connected to 
a high degree node, probability of spread of information 
to that node is low, as high degree node is making 
only one push to a randomly chosen node in one time 
slot. So, if a high degree node makes k pushes to k 
randomly chosen nodes in one time slot, where k is 
ratio of the degree of high degree node to OQogN) or 
alternatively the average degree of its neighbours, this 
problem will not be there and gossip originated in H 
will also reach within OQog 2 N) time to every node with 
high probability. ■ 

Suppose nodes start gossip at n = and the number 
to be averaged at i th node is do,; whereas after n steps of 
gossipping the number is d„ i and similarly gossip weight 
after n steps of gossipping at i tb node is g n j. If in process 
of gossipping a node j chooses node i and pushes, then 
the contribution by j to i will be c„ ; ;.-. So it can be said 
that d n j = Yjj c nfii j ■ d 0/ „ and g nJ = c n ,i,j- 

Theorem V.2. Uniform Gossip diffuses with differential push 
in PA based graph within 0(log 2 N + logh) time with high 
probability such that contributions at all nodes will be £, 
uniform after this amount of time, i.e. maxj\^^ — ^| < £ 

the 



every node is making p pushes to p different nodes 



We can see 
Hon (proposition 
as well. 



V.3i 



property 
from J3U 



of mass 
holds in 



conserva- 
this case 



Proposition V.3. Under the protocol differential Push with 
Uniform Gossip, at any time n, the sum of all of js contribu- 
tions Y,i c n,i,j and hence the sum of all weights is at all nodes 
i is Lign,, =N 

proof of theorem [K2] We first need to study a function 
\\)„. Here if>„ is defined as 



4>n 



(26) 



V + 1 r n ' y N ) Lu „ + 1 \ Cn ' k 'i n ) 



k:f(k)=i ' 



(27) 



(p + 1)2 E ( c *'W n' ) + (p + 1)2 E E ( c "- k -> g N ) 

' i,j ' i,) k:f(k)=i 



■71,1,1 



§n,i\( gnjc\ 



(p + VfJ-k L. 

yr ' i,] k:f(k)=i 

2 y y / gn,k\( gn,k\ 

(p + lf La La \ Cn M N )\° n ^ N) 
hi m-.f(k)=f(k)=i 

1 , 2 y / _gnAl _gnA\ 

{ V + 1) + (p + 1)2 . . Ll . r« n ) \ Cn ' K > n ) 

h],k:f(k)=i 

2 y y / _8n*\( 8„,k\ 

+ 1)2 La La \ Cn ^ N ){ C "^ N) 

j,k Uk:f(k)=f(k) 



2 

(pTi) 



Here f(k)=i means node k chooses node i and pushes 
gossip pair. In the last step, we used the fact that each k 
appears in the sum for exactly p nodes with same value 
for each node. Now we will take expectation. Let's say 
degree of kth node is dk and we know that node will 
choose a node randomly among its neighbours so 



P[f(k) = i] = 



Plf(k)=f(k)] = 



dt (t) 



( N ) 



N 

p J> f Jl. n 

N N 



(28) 
(29) 



Here P^ k is the probability that node k has degree dk- For 
networks generated by PA Model, 



P(dk) ~ d~ y 



so first we will study \\) for p-push gossip, i.e. when Here y is network exponent. 
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Hence 



i,j,k 

(cnXj-^)P[f(k) = i] 

),k k*k 

(c nlr g -£)p[f(k)*f(k)] 



P*' N ) N (p + l) 2 



P*' jvil"^ NlN' ld < 

;,k,k 

2 y / _ gk\ 2 v \ 
(v + \y L^\ Cn -i- k n) N 

j,k 



+i 1| '" + ( P+ i)2 N ZZh n) 



k 



7 k 



i 

(p + l)2]vE( c »^-|) P 4 



Let us assume that the maximum value of Pa is Pa 
and minimum value is Pd„ lin and the difference of Pd m 
and Pd m .„ is K c , then 



- pTT^" + (» + Wn E E ^ _ E 



y v ! 



I(^-f)"* 



+ (p + i)2 n E E c >^i Pd ™* E °N Pd ™" 



|E cn ''-i Pd ««" E ^ 

(p + i)2 N \ c "-i* n) * 



Applying mass conservation 



1 



+ 1 T " ' (p + l) 2 N 



(p + 1) 2 N 
1 , 



2-K? 



+ 1 T (p + 1) 2 (p + l) 2 N 



1 2-K 2 

< — r t|>» + 



r' + l T " ' (p + 1) 2 

In the last line we use the fact that the minimum value of 



Pi is 0. If we consider the value of y to be 2, maximum 

u min / ' 

value c 
hence, 



value of X 2 will be ^ as minimum value of d is 2 and 



E[% + i | tftj < ^ + (30) 

So if we use the fact that maximum possible value of ij>o 
is N then, 



E[i|>„] < N-(p + l)-" + 
< N • (p + I) - " + 



1 



1 



8 • (p + 1) 8p(p + 1)» +2 
1 



8 ■ (p + 1) 

It can be seen that right hand side of the above equation 
is maximum when p=l. It means potential function is 
decaying at the slowest rate for p=l. So time taken in 
convergence for p=l. For normal push algorithm will 
act as upper bound for differential push algorithm i.e. 
combination of different values of positive integer p's. 
So taking p=l; 

Ety„] < N-2~ n + 1 
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< N-2~ n -k d 



(31) 



Here is an integer constant that is greater then the 
ratio of maximum value of N ■ 2~" and h. This can be 

16 

seen that JQ will depend on the number of steps required 
for convergence. 

Now let us assume h = logN and absolute error £, = 
£ 2 -2~ 2 ". After gossipping for n = logN+logka+logl steps, 

E[\\>„] < t, i.e. \Cn,i,j - f^l < i, -2~" for all nodes i with high 
probability. 

Let us observe about weights. If we consider weight as 
an information to be spread among the nodes, according 
to theorem |V.1| information will reach to all nodes in a 
power law network with high probability in n = log 2 N 
rounds. After these n rounds every node will receive at 
least 2~ n weight. So applying union bound over weight 
and potential event and dividing with g n j results that 
- j}\ < S at steps 0(log 2 N + logN + logk d + log\) i.e. 
within 0(log 2 N + logl) steps with high probability. ■ 

On the basis of these two theorems, it can be seen 
using Triangle Inequality under sum, as in [30], that with 
high probability relative error in average estimation will 
be bounded by £ after 0(log 2 N + log?) gossip steps. 
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Fig. 1. Average absolute change in reputation with different N anda 
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Fig. 2. Network utilisation by a network of 1000 nodes with different 
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VI. Numerical Results 

Performance of algorithm for reputation estimation 
and aggregation for peer to peer file sharing is evaluated 
by simulation as well. A power law network has been 
built using Preferential Attachment model. 

A. Trust Estimation 

Performance of estimation method proposed in this 
paper has been evaluated for 100 and 1000 node net- 



work. Time is considered as slotted. Every slot is termed 
as an iteration. At the start of every slot every node 
queries for some resource and after getting reply it asks 
for resource from replying nodes. Node asks for resource 
two times its download capacity from the nodes with 
resource. Requested nodes allocate their bandwidth as 
per their reputation table. Finally requesting node and 
requested nodes hand shake and transect. At the end 
of slot each node updates its reputation table as per 
the quality of transaction. First 100 iterations have been 
taken as acquittance period i.e. node will allocate their 
bandwidth without referring the reputation table. 

Figure [T] shows the average absolute change in rep- 
utation estimation with increasing number of iterations 
up to 1000. Here,average absolute change in reputation 
(AR)is 

'■i 

RiMtr is the reputation of j for i in ith iteration, d, is the 
degree of ith node, N is the total number of nodes and 
itr is the number of iteration under consideration. 

In figure [T] AR is plotted for an old reputation estima- 
tion technique) 25] and proposed reputation estimation 
method for homogenous and heterogeneous networks 
for 100 and 1000 nodes with a= 0.1,0.01and 0.001. 

In figure [2] the utilisation level of shared resources 
in the network is plotted for homogenous and heteroge- 
neous networks of 1000 nodes for a 0.1 and 0.01. 



B. Trust Aggregation 

Performance of differential algorithm has been evalu- 
ated in terms of number of iterations (to assess the rate 
of convergence) required to converge within a particular 
degree of aggregation error. Number of packets per 
node per gossip step that are required to be transmit- 
ted for convergence have also been calculated, so that 
network overhead can be assessed. Algorithm is also 
tested against collusion and node churn. The simulation 
experiments has been conducted for 100 to 50000 nodes. 

Figure [3] shows the number of gossip steps required 
for different error bounds for different number of nodes. 
This is clearly evident that number of gossip steps is 
increasing with a rate much less then OQog 2 N) i.e. said 
bound is followed for all the cases. 

Figure [4] and [5] are the simulation results against the 
problem of churn. Figure [4] shows the number of gossip 
steps required with different packet loss probability for 
10000 nodes whereas figure [5] shows the number of 
gossip steps required with different number of nodes 
with£, = 0.00001. Peer to peer networks operate above 
TCP layer, i.e. these kind of networks assume a reliable 
bit pipe between sender and receiver. So peer to peer 
network suffers by packet loss only when some node 
leaves the network i.e. due to churning. Here the as- 
sumption is when a node leaves during gossip process, it 
hands over the gossip pair vectors to some other node so 
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4=0.01 


4 = 0.001 


£ = 0.0001 


£, = 0.00001 


N=100 


1.212 


1.203 


1.042 


1.235 


N=500 


1.199 


1.194 


1.125 


1.189 


N=1000 


1.178 


1.159 


1.157 


1.148 


N=10000 


1.156 


1.139 


1.124 


1.122 


N=50000 


1.152 


1.132 


1.119 


1.112 



TABLE I 

Number of Message Per Node per Step Transmitted Due to 
Gossipping 



mass conservation still applies. Whenever a node pushes 
gossip pair to this absent node, the pushing node doesn't 
receive any acknowledgement. In such cases pushing 
node pushes the gossip pair to itself so that mass conser- 
vation still applies. We can see a small increment in the 
number of gossip steps with the increase in the packet 
loss probability. 

Figure [6] and [7] are the simulation results against the 
problem of collusion. Figure [7] plots the average RMS 
error for different percentage of users that are colluding 
individually i.e. they are not forming any group whereas 
figure [6] plots the average RMS error with different 
group sizes of colluding users with different percentage 
of colluding users in the network. 

Here average RMS error is defined as follows, 



Average RMS error = — ^ \ 



N 



(33) 



Here r,j is the reputation of node i at node j com- 
puted by differential gossip in presence of colluding 
nodes, whereas rlj is the reputation computed without 
considering colluding nodes. So, this is clearly evident 
that effect of collusion on reputation computation by 
differential gossip is quite minimum even with very 
high percentage of colluding users. The colluding group 
size is making a small difference in differential gossip 
reputation computation. 

Table [I] shows the number of message transfer re- 
quired by a node in one gossip step. It can be seen that 
this is almost constant and even decreasing a bit with 
the increase in number of nodes. This is happening be- 
cause as number of gossip steps increases the overhead 
incurred in the start gets distributed and a node is less 
burdened as the number of total nodes increases. 

VII. Conclusion 

In peer-to-peer networks free riding is a major prob- 
lem that can be overcome using reputation management 
system. A reputation management system includes two 
processes, first estimation of reputation and second ag- 
gregation of reputation. In this paper we have proposed 
an estimation technique using BLUE and an aggregation 
technique by a modifying push gossip algorithm to 
differential push gossip algorithm. 

The proposed trust estimator consider uncertainties in 
the trust estimation. The average absolute change in trust 
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Fig. 3. Gossip step counts with different number of nodes(N) and 
different error boundsE 
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Fig. 4. Gossip step counts for N=10000 with different error bounds^ 
for different packet loss probability 



values in the trust table of nodes is considerably less 
then older techniques. It employes that reputation can 
be estimated more accurately using this estimator. The 
better estimation of reputation may lead the system to 
a better utilisation of resource as evident from figure [2] 
So it is evident that proposed estimator performs better 
then available techniques. 

The proposed aggregation technique aggregates the 
trust values from different nodes in a power law net- 
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work. This technique does not need the identification 
of power nodes. As identification of power nodes in 
distributed setting is hard, this makes algorithm im- 
plementable. This is robust against churning as can be 
seen in figure [4] and [5] Proposed technique aggregates 
the reputation in a differential manner. This is done by 
considering the feedback of trusted nodes with a higher 
weight. This leads to robustness against collusion as 
evident from figure [6] and U\ 



Proposed algorithm is presented to avoid the problem 
of free riding but this can also be used to avoid malicious 
users in the network with minor changes. 
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